Modeling Legislation Using Natural Language Processing

ثبت نشده
چکیده

$EVWUDFW This paper describes the possibilities of the translation of legislation, which is written in natural language, into a formal language, i.e. UML/OCL. The tool OPAL (Object-oriented Parsing and Analysis of Legislation) is developed to support the automatic modelling of legislation with the use of appropriate NLP techniques. The aim is not to perform this modelling in a batch fashion from legislation to final model, but interactively in dialogue with the knowledge engineer. The main components of OPAL are a parser (based on a chart-parser algorithm) and a model generator. A special component called modelling interface is added to OPAL to give the knowledge engineer the possibility to keep track of the modelling process and to make adjustments to the final model. 1Introduction The application of legislation is equivalent to judging the consequences of a given situation (the case) within the context of the law. Two things are needed to judge these consequences, the law and a description of the case. The ideal situation would be that a machine performs the judgement of the juridical consequences. Before this is possible, the machine at least must be able to read and interpret the law and the case. If we take a closer look at the law, we see that it is written in natural language, namely Dutch in this research. There are some aspects, in addition to arbitrary texts in natural language, that make legal texts a bit more formal. In (Dutch) legislation for example fixed language constructs are used extensively. Legislation is also organised in a hierarchical structure. The fact that legislation is written in natural language causes some difficulties. The most important problem is that natural language is ambiguous, which entails that an expression in natural language can have multiple meanings. This problem can arise at word level, for example 'bank', and at phrase or sentence level, for example 'the man sees the woman with the telescope'. Another problem that arises is that natural language, and legislation as well, contains vague and unclear notions like 'almost' and 'for the most part'. So before law enforcement can be automated, legislation has to be translated into a language that does not have aforementioned problems and can be read by a computer, for example a specification language. An extra benefit from the translation of legislation into a specification language is that ambiguous constructs in the law can be detected at an early …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative Linguistics via Language Modeling

Natural languages are related to each other genetically. Traditionally, such genetic relationships have been demonstrated using the comparative method. I describe a software system that implements an alternative method for detecting such relationships using simple statistical language modeling.

متن کامل

Bayesian approaches in Natural Language Processing

This paper overviews Bayesian approaches in natural language processing that are becoming prominent. Without any knowledge of natural language processing, Bayesian approaches to both discriminative learning and generative modeling are described. Especially, näıve bayes and its full unsupervised Bayesian modeling, DM, and LDA are developed. These Bayesian approaches permit interesting joint mode...

متن کامل

A Survey on Statistical Approaches to Natural Language Processing

This survey attempts to catch up with the recent increasing interests in statistical approach to natural language processing based on large corpora. First of all, a historical overview traces back to 1950s when Noam Chomsky proposed his phrase structure transformation grammar and rejected the Markov process natural language modeling. With the development of large corpora and language modeling i...

متن کامل

Using Generalized Language Model for Question Matching

Question and answering service is one of the popular services in the World Wide Web. The main goal of these services is to finding the best answer for user's input question as quick as possible. In order to achieve this aim, most of these use new techniques foe question matching. . We have a lot of question and answering services in Persian web, so it seems that developing a question matching m...

متن کامل

Language Modeling With Dynamic Bayesian Networks Using Conversation Types and Part of Speech Information

In this paper we investigate whether more accurate modeling of differences in language in different types of conversations, e.g. formal presentations vs. spontaneous conversations can improve the quality of a language model. We also investigate whether the modeling of sentence lengths can improve a language model. A language model is an important component of statistical natural language proces...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001